Selection of Reliable Likelihood Ratios for Statistical Model-Based Voice Activity Detection
نویسندگان
چکیده
A statistical model-based voice activity detection (VAD) is a robust algorithm in noisy condition to detect speech region from input signal by speech and non-speech statistical model such as complex Gaussian probability density function (PDF). The decision rule used in this VAD is based on Bayes’ rule and considers likelihood ratios (LRs) in whole frequency region. In this VAD, however, the Bayes’ rule may cause a decision error. With the statistical model, we analyze why this problem happens and show how we can decrease the decision error by using the LRs at selected frequency bins having relatively high spectral power in each frame. The performance of this VAD is evaluated by receiver operating characteristic (ROC) curves and summarized in a table, and the results from proposed methods show better performances than those of typical statistical model-based VAD.
منابع مشابه
Reliable likelihood ratios for statistical model-based voice activity detector with low false-alarm rate
The role of the statistical model-based voice activity detector (SMVAD) is to detect speech regions from input signals using the statistical models of noise and noisy speech. The decision rule of SMVAD is based on the likelihood ratio test (LRT). The LRT-based decision rule may cause detection errors because of statistical properties of noise and speech signals. In this article, we first analyz...
متن کاملA statistical model-based voice activity detection employing minimum classification error technique
In this paper, we apply a discriminative weight training to a statistical model-based voice activity detection (VAD). In our approach, the VAD decision rule is expressed as the geometric mean of optimally weighted likelihood ratios (LRs) based on a minimum classification error (MCE) method. That approach is different from that of previous works in that different weights are assigned to each fre...
متن کاملPartial mutual information based input variable selection for supervised learning approaches to voice activity detection
The paper presents a novel approach for voice activity detection. The main idea behind the presented approach is to use, next to the likelihood ratio of a statistical model-based voice activity detector, a set of informative distinct features in order to, via a supervised learning approach, enhance the detection performance. The statistical model-based voice activity detector, which is chosen b...
متن کاملVoice Activity Detection Based on Discriminative Weight Training Incorporating a Spectral Flatness Measure
In this paper, we present an approach to incorporate discriminative weight training into a statistical model-based voice activity detection (VAD) method. In our approach, the VAD decision rule is derived from the optimally weighted likelihood ratios (LRs) using a minimum classification error (MCE) method. An adaptive online means of selecting two kinds of weights based on a power spectral flatn...
متن کاملVoice activity detection based on statistical models and machine learning approaches
The voice activity detectors (VADs) based on statistical models have shown impressive performances especially when fairly precise statistical models are employed. Moreover, the accuracy of the VAD utilizing statistical models can be significantly improved when machine-learning techniques are adopted to provide prior knowledge for speech characteristics. In the first part of this paper, we intro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012